Ranking-based readability assessment for early primary children's literature
نویسندگان
چکیده
Determining the reading level of children’s literature is an important task for providing educators and parents with an appropriate reading trajectory through a curriculum. Automating this process has been a challenge addressed before in the computational linguistics literature, with most studies attempting to predict the particular grade level of a text. However, guided reading levels developed by educators operate at a more fine-grained level, with multiple levels corresponding to each grade. We find that ranking performs much better than classification at the fine-grained leveling task, and that features derived from the visual layout of a book are just as predictive as standard text features of level; including both sets of features, we find that we can predict the reading level up to 83% of the time on a small corpus of children’s books.
منابع مشابه
Comparing human versus automatic feature extraction for fine-grained elementary readability assessment
Early primary children’s literature poses some interesting challenges for automated readability assessment: for example, teachers often use fine-grained reading leveling systems for determining appropriate books for children to read (many current systems approach readability assessment at a coarser whole grade level). In previous work (Ma et al., 2012), we suggested that the fine-grained assess...
متن کاملContent-Based Readability Assessment: A Study Using A Syllabic Alphabetic Language (Thai)
Text readability is typically defined in terms of “grade level”; the expected educational level of the reader at which the text is directed. Mechanisms for measuring readability in English documents are well established; however this is not in case in many other languages, such as syllabic alphabetic languages. In this paper seven different mechanisms for assessing the readability of syllabic a...
متن کاملFeatures indicating readability in Swedish text
Studies have shown that modern methods of readability assessment, using automated linguistic analysis and machine learning (ML), is a viable road forward for readability classification and ranking. In this paper we present a study of different levels of analysis and a large number of features and how they affect an ML-system’s accuracy when it comes to readability assessment. We test a large nu...
متن کاملSurveying the Experts View on the Necessity of Revision in Rating of Children and Adolescents's Books
Background and Aim: The purpose of this study was to find out the current status of non-academic rankings of children's books and survey the experts view on the revision scheme in the classification of such books. Method: The qualitative study was employed. The research tool was a questionnaire based on the research objectives. Openended interview data collection method was used based...
متن کاملAssessing the Readability of Sentences: Which Corpora and Features?
The paper investigates the problem of sentence readability assessment, which is modelled as a classification task, with a specific view to text simplification. In particular, it addresses two open issues connected with it, i.e. the corpora to be used for training, and the identification of the most effective features to determine sentence readability. An existing readability assessment tool dev...
متن کامل